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(54) Document retrieval system 

(57) Linguistic features extracted from the query are 
classified into concepts expressing content of the query, 
and linguistic features extracted from the document are 
classified into concepts expressing content of the docu- 
ment. The user confirms of which concept the input 



statement and the documerrt are comprised and which 
kind of linguistic feature corresponds to each concept, 
and the system assists the user to adequately select a 
document, which Is closer to the intention of retrieval. 
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Description 

BACKGROUND OF THE INVENTION 
5 [Reld of the Invention] 

The present invention relates to a document retrieval system for retrieving a document suitable for a user's intention 
of retrieval from electronic documents. 

10 [Description of the Related Art] 

^ The following two retrieval systems have been proposed in the past: a retrieval system based on exact match tech- 

nique, in which a user inputs queries comprising character strings and logic operators and obtains a document assem- 
bly, which satisfy the condition, and a retrieval system based on partial match technique, in which the similarity between 

«r5 the input by the user and the document to be retrieved are compared and collated with each other by some measure, 
and the documents are ranked according to the closeness to the user's intention of retrieval. 

The retrieval based on the exact match technique is advantageous in that the query inputted by the user clearly cor- 
responds to thexlocument assen^ly as the result of retrieval. However, the vocabulary used in the document to be 
retrieved is often unclear to the user, and it is difficult to specify a suitable keyword in the query, and in case a large 

•20 amount of document assemblies have been obtained as the result of retrieval it is so hard for a user to choose relative 
documents one by one. 

On the other hand, the retrieval based on the partial match technique is advantageous in that ranking is assigned 
^> to the documents as the result of retrieval according to the closeness to the user's intention of retrieval, while it is not 
necessarily dear to the user as to what kind of document are ranked on what basis. 
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SUMMARY OF THE INVENTION 



In the present invention, linguistic features expressing content of query and document are classified into concept, 
which expresses each content to clarify the query and the document and the concepts which comprise linguistic fea- 
30 ture is shown of the document to be retrieved actually corresponding to each concept which conprises the query and 
document, and to support in effteientiy selecting the document, which is closer to the user's intention of the retrieval. 

To solve the at>ove problems, the document retrieval system according to the present invention comprises an 
irput/output control unit for receiving input from a user and for showing result of processing to the user, a user request 
processing unit for receiving a request other than query among the inputs from the user and for processing content of 
35 the request a document storage unit for storing a document to be retrieved, a feature extracting unit for extracting lin- 
guistic feature of the query inputted from said input/output control unit as irput feature or for extracting linguistic feature 
- . of the document stored in the document storage unit as document feature, an input feature storage unit for storing input 
feature extracted from the by the feature extracting unit, a document feature storage unit for storing linguistic feature 
taken out from the document by the feature extracting unit, a feature classifying unit for classifying the input feature 
.40 stored in the input feature storage unit to correspond to concept of the content expressed by the query or for classifying 
- the document feature stored in the document feature storage unit to correspond to concept of the content expressed by 
the document, an input feature classification storage unit for storing correspondence between the input feature classi- 
fication and the input feature as the result of classification of the input feature by the feature classifying unit and a doc- 
ument feature classifying storage unit for storing correspondence between the document feature classification and 
».:4S document feature as a result of classification of document feature by the feature classifying unit, whereby, in response 
to a request from the user via the user request processing unit, the content of the query or the specified document is 
presented to the user via the input/output control unit as corresponding relationship between group of concepts 
^ . expressed by the input feature classification and the document feature classification and the linguistic feature belonging 
to each concept group. 

50 According to the present invention, the user can confirm easily the user which concept the query or the document 
is comprised ot and further which linguistic feature corresponds to each concept and hence, eff identiy select the doc- 
ument, which is closer to the intention of i-etrieval. 

The system according to Claim 1 of the present invention comprises an input/output control unit for receiving input 
from a user and for presenting result of processing to tine user, a user request processing unit for receiving a request 

55 otiier than query among the inputs from tiie user and for processing content of tiie request, a document storage unit for 
storing a document to be retrieved, a feature extracting unit for extracting linguistic feature of the query inputted from 
said input/output control unit as input feature or for extracting linguistic feature of the document stored in the document 
storage unit as document feature^ an input feature storage unit for storing input feature extracted from the query by the 
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feature extracting unit, a document feature storage unit for storing linguistic feature taken out from the document by the 
feature extracting unit, a feature classifying unit for classifying the input feature stored in the input feature storage unit 
to correspond to partial concept of the content expressed by the input statement or for classifying the document feature 
stored In the document feature storage unit to correspond to concept of the content expressed by the document, an 

5 input feature classification storage unit for storing correspondence between the input feature classification and the input 
feature as the result of classification of the input feature by the feature classifying unit, and a document feature classi- 
fying storage unit for storing conrespondence between tiie document featijre classification and document feature as a 
result of classification of document feature by the feature classifying unit. and. in response to the request sent from tiie 
user via the user request processing unit, contents of tiie query or the specified document are presented to tiie user via 

10 the input/output control unit as corresponding relationship between the group of concepts expressed by tiie input fea- 
ture classification and the document feature classification and tiie linguistic feature belonging to each concept group, 
and correspondence between the concept expressing the content of the query by tiie input feature classification and 
the input feature belonging to each input feature classification as well as correspondence between the concept 
expressed by tiie document feature classification of tiie document and tiie document feature belonging to each docu- 

15 ment feature classification are priesented in response to tiie request of the user, the system thus helps the user in easily 
recognizing which concept the input statement or the document is comprised of. and further, which kind of linguistic fea- 
ture corresponds to each concept. 

In tiie system according to Claim 2 of the present invention, in response to tiie request from the user, ttie input fea- 
ture classification stored in the input feature classification storage unit and the correspondence between the input fea- 

20 ture classification and the input feature of tiie specified document and tiie document feature classification stored in tiie 
document feature classification storage unit and the correspondence between tiie document feature classification and 
the document feature, and further, the document feature stored in tiie document feature storage unit are added, deleted 
or corrected, whereby the contents of the input statement and the document are expressed more adequately 

The system according to Claim 3 of the present invention comprises a feature collating unit for collating an input 

25 feature classification stored in the input feature classification storage unit and tiie corresponding input feature witii the 
document feature classification stored in the document feature classification storage unit and tiie corresponding docu- 
ment feature, and a collating result storage unit for storing collating method and result by the feature collating unit, 
whereby the contents off the query and ttie document are compared and collated with the concept expressed by tiie 
input feature classification arxJ the input feature belonging to the concept and the concept expressed by tiie document 

30 feature classification and the document feature belonging to the concept and, as tiie result of calculation of similarity 
between the query and tiie document, not only the score and the ranking of each of the documents finally obtained but 
also metiiod and result of collating of the similarity are presented to tiie user, thereby demonstirating to the user at which 
viewpoint the collating has been performed and how each document has been evaluated. 

In tiie system according to Claim 4 of tiie present invention, when tiie query is collated with the accunrujiated doc- 

35 ument. a specific document feature classification and the document feature selected by the user via ttie user request 
processing unit are treated as provisional input feature classification or input feature, the feature collating unit collates 
them with the document feature dassif rcation and tiie document featijre of each document, and tiie result of collating is 
presented to the user, whereby, by regarding the specific document feature classification and the document feature pro- 
visionally as tiie input feature classification and tiie input feature and collecting them with ttie document if ttiere is any 

40 concept or linguistic feature not appearing in the original input feature classification or the input feature and being 
regarded as adequate as tiie input feature classification or the input feature, these are collated with the document and 
the effect can be easily confirmed by the user. 

In tiie system according to Claim 5 of tiie present invention, when tiie query is collated witii tiie accumulated doc- 
ument, weight is set to the input feature classification or ttie input feature by the degree of inportance specified by the 

45 user via the user request processing unit tiie feature collating unit collates tfie input featijre classification or the input 
feature thus weighted witti the document feature classrfk:ation or the document feature of each document, and tiie 
result of collating is presented to tiie user, whereby the user gives the degree of importance to the input feature classi- 
fication or the input feature as a concept to constitute the input statement or as linguistic feature, to clearly demonstrate 
the intention of retrieval, and accuracy of reti-ieval is increased. 

50 

BRIEF DESCRIPTION OF THE DRAWINGS 

The object and featijres of the present invention will become more readily apparent from tiie following detailed 
description of the preferred embodiments taken in conjunction with the accompanying drawings in which: 

55 

Ftg. 1 is a block diagram showing functional arrangement of a document reti-ieval system in Embodiment 1 of tiie 
present invention; 

Fig. 2 shows an example of query in EmtxKliment 1 of the present invention; 
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Fig. 3 gives examples of data of an input feature storage unit in Embodiment 1 of the present invention; 

Fig. 4 represents an example of hierarchical thesaurus in Embodiment 1 of the present invention; 

Fig. 5 shows a first example of data of an input feature classification storage unit in Embodiment 1 of the present 

invention; 

5 Rg. 6 shows exanples of data of a document feature classification storage unit in Embodiment 1 of the present 
invention; 

Rg. 7 shows a first example of a document ranking table in Embodiment 1 of the present invention; 

Fig. 8 shows a second example of the data of the input feature classification storage unit in Embodiment 1 of the 

present invention; 

10 Fig. 9 shows a second example of a document ranking table in Embodiment 1 of the present invention; 

Rg. 10 shows an example of provisional input feature classification and input features in EmtxxJiment 1 of the 
present Invention; 

Fig. 1 1 shows a third example of a document ranking table in Embodiment 1 of the present invention; 
Fig. 1 2 shows a third example of data of the input feature classification storage unit in Embodiment 1 of the present 
.75 invention; 

Fig. 1 3 gives a fourth example of data of the input feature classification storage unit in Embodiment 1 of the present 
invention; and 

Fig. 14 shows a fburtii exanple of the document ranking table in Embodiment 1 of the present invention. 
20 DETAILED DESCRIPTION OF THE INVENTION 

In the following, description will be given on embodiments of the present invention referring to Fig. 1 to Rg. 14. 

Rg. 1 is a block diagram showing functional arrangement of a document retrieval system according to an embodi- 
ment of the present invention. In Fig. 1. reference numeral 11 represents an input/output control unit 12 represents a 
25 user request processing unit. 1 3 is a data storage unit 1 4 is a document feature storage unit. 1 5 is an input feature stor- 
age unit. 16 is a document feature classification storage unit. 17 is an input feature classification storage unit 18 is a 
collating result storage unit. 19 is a document storage unit. 20 is a feature classifying unit, and 22 is a feature collating 
unit. 

In the following, description will be given on operation of the document retrieval system with the above arrange- 

30 ment. Rrst, a query described in natural language is inputted from a user via the input/output control unit 1 1 . The fea- 
ture extracting unit 20 analyzes the input statement, exf acts important words and phrases as linguistic features. arKi if 
necessary, these are stored in the input feature storage unit 15 together with statistical information such as frequency 
of appearance of tiiese features and degree of importance . It is also possible to describe these input featajres by devel- 
oping them to homonyms, synonyms, narrower term, etc. using a thesaurus. 

35 Rg. 2 shows examples of a query. Rg. 3 summarizes examples of data stored in the input feature storage unit 15 
in case unnecessary words such as symbols, particles, etc. are exempted from the words obtained tiirough morpholog- 
ical analysis to the query of Fig. 2 and the remaining words are regarded as input features. 

On the other hand, the feature extracting unit 20 analyzes each of the documents stored in the document storage 
unit 19 in similar manner. Important words and phrases are extracted as linguistic features. arxJ if necessary, these are 

40 Stored in the document feature storage unit 14 together with statistical Information such as frequency of appearance of 
these features and degree of importance. 

When the feature extracting unit 20 extracts important words and phrases, for exanrtple. information such as fre- 
quency of the words, distribution of words among documents, parts of speech, appearing position in tiie document, syn- 
tactic and semarrtic relationship witii other words, etc. are used for the judgment of the degree of importance of the 

45 words. 

Next, the feature classifying unit 21 classifies the input features stored in the input feature storage unit 15 to conre- 
spond to each concept, which comprises the query, puts a classification name and stores it to the input feature storage 
unit 1 7. To classify tiie input features, it is supposed that the feature classifying unit 21 possesses hierarchical thesau- 
ruses. Nodes at a given depth on the hierarchical thesauruses are set as criteria for classification, and the words having 

so semantically closer concept under the node are put together. Also, there is a method to put together specific words hav- 
ing closer syntactic or semantic relationship using concurrence dictionary or concept dictionary. Rg. 4 gives an example 
of a part of the hierarchical thesaurus possessed by the feature classifying unit 21 . 

As the classification name to express coherence or grouping of the input features, a word with a concept corre- 
sponding to the node at a given depth in hierarchical thesaurus possessed by the feature classifying unit 21 with respect 

55 to the input feature of a certain group and with broader concept of a plurality of words is used. In addition, there are a 
method to use one of homonyms or synonyms in the thesaurus or a method to use one of tiie words belonging to the 
same input feature classification. Fig. 5 shows examples of data stored In the input feature classification storage unit 1 7 
such as input feature corresponding to the input feature classification obtained using the t>roader concept 1 as criterion 
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for classification in the hierarchical thesaurus of Rg. 4. 

Further, the feature classifying unit 21 also classifies the document feature stored in the document feature storage 
unit 14 so that it corresponds to concept comprising the document, puts a classification name and stores it in the doc- 
ument feature classification storage unit 16. The method to classify the document feature and to determine the classi- 

5 f ication name is the same as in the classification of the above input feature. Rg. 6 shows examples of data stored in the 
document feature classification storage unit 16 such as correspondence between the document feature classification 
and the document feature. In Rg. 6. the document feature classification common to the input feature classification and 
the document feature common to the input feature are enclosed by the symbol TT, and identifiers such as ''A". "B**, etc. 
are put to the input feature classification and the document feature classification. These notations and syn^ls will be 

10 also used in the description hereinafter. 

Next, the feature collating unit 22 collates the document feature classification and the corresponding document fea- 
ture stored in the document feature classification storage unit 16 witii tiie input feature classification and tiie conre- 
sponding input feature stored in the input feature classification storage unit 17 and determines the ranking of the 
documents. The results of the collating of tiie document and a document ranking table indicating the ranking are stored 

IS in the collating result storage unit 1 8. and the document ranking table is* presented to the user via the input/output con- 
trol unit 11. 

Descnption will be given on an example of collating of the document feature classification and tiie document feature 
witii tiie input feature classification and the input feature, referring to the examples of tiie data in the input feature clas- 
sification storage unit 1 7 shown in Rg. 5 and to the examples of the data of tiie document feature classification storage 
20 unit 16 given in Rg. 6, 

As an example of a method to calculate score of each document evaluation function is used as follows: 

Score E (a) of the document a = £ (Weight of document feature cfassificatioh to which 

document feature belcwigs x Weight of document feature x Equation 1 : 

Frequency of appearance of tiie document feature in the document a) 



25 



In case the input feature classification and the input feature are not weighted, the document feature classification 
and the document feature are evaluated depending upon whether the corresponding input feature classification and 
30 input feature are present or not. As the metiiod to determine tiie weight of each document feature classification and tiie 
document feature, it is supposed that weight is given according to: 

Rule1: 

35 (la) to give weight 1 to the document feature classification having the conresponding input feature classification; 

(1b) to give weight 1 to the document feature of each document feature classification having the corresponding 
input feature; and 

(1c) to give weight 0 to all of the document feature classification and document feature other than (la) and (lb) 
above. Because the in<lpcument frequency of appearance of tiie document feature of Fig. 6 is 1 in all cases, the 
40 scores of the documents are obtained by the equation 1 as follows: 

Document 1: 

E (1) = 1 X 1 X 1 + 1 X 1 X 1 + 1 x 1 X 1 + 1 x 1 X 1 + 
0x0x1+0x0x1+0x0x1+0x0x1 + 
0x0x1+0x0x1+0x0x1 
= 4 
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Document 2: 



E (2) = 1x1x1+0x1x1+0x0x1+0x0x1 + 

1x1x1+0x1x1+1x1x1+0x1x1+0x1x1 
= 3 
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Document 3: 



E (3) = 1 X 1 X 1 



+0x1x1+1x1x1+1x1x1+ 
+0x1x1+1x1x1+1x1x1+ 
+0x1x1+0x1x1 



1x1x1 
0x1x1 
= 6 



As a result, the ranking is: document 3 - document 1 - document 2. 

Fig. 7 shows examples of the document ranking table in the above case. In Rg. 7. subtotal of the scores of docu- 
ment feature of each document is given as partial score, and the same applies to all of the subsequent ranking tables. 
Although it is not given in Fig. 7, frequer^y of appearance of the document feature in each document or weight given 
to the document feature classification and the document feature can be given in the ranking table. 

Also, ranking can be given by putting weight to the irput feature dassrf icatioh and the input feature. For example, 
to the exanples of the data of the input feature classification storage unit 1 7 shown in Rg. 5. it is supposed that the user 
specifies, via the user request processing unit 12, thus collating with the documents having the document feature clas- 
sification and the?document feature shown in Rg. 6. presuming that the weight of the input feature classification is 1, 
the weight of the input feature belonging to the irput feature classification C is 1 , and the weight of the other input fea- 
ture classification and the input feature is 0. 

Rg. 8 shows examples of the data of the corrected input feature classification storage unit 17 as the result of the 
weighting to the data of the input feature classification storage unit of Rg. 5 at the request of the user. 

In the data of the input feature classification storage unit 17 of Rg. 8. the frequency of the input feature is 1 in all 
cases, and description is omitted. The same applies to the explanation of the data of the input feature classification stor- 
age unit 1 7 hereinafter. It is supposed that weight is given to the document feature classification and the document fea- 
ture according to: 



(2a) to the document feature classifications having corresponding input feature classification, give weight of the cor- 
responding input feature classification; 
; (2b) to the document features having conresponding input feature, give weight of the con-esponding input feature; 
and 

(2c) to give weight 0 to all of the document feature classifications and the document features other than the (la) 
and (lb) above. 

Because the in-document frequency of the document feature of each document is 1 as given in Rg. 6, when the score 
of each document is calculated by the calculation method shown in the above equation 1 , the ranking of the documents 

1 to 3 is given as: document 3 - document 2 - document 1 . 

Rg. 9 summarizes an exanple of document ranking table obtained as the result In Fig. 9, the document feature 
classification and the document feature with weighting are enclosed by the symbol **[ ]**. 

Also, it is possible to give ranking of documents, supposing that the document feature classification and the docu- 
ment feature as provisional input feature classification and input feature. For exanple. it is supposed that the user 
requests to collate, among the document features belonging to the document feature classification C in the documents 

2 and 3 in Fig. 6, the document features Tx)ir, "steam" and "dress" as provisional input features with the documents of 
Rg. 6 and that this has been input via the user request processing unit 12. Rg. 10 shows examples of the provisional 
input feature classification and the input features to be stored in the input feature classification storage unit 17 in case 
the weight of the provisional input feature classification C is 1 and weight of each of the provisional input features "boil", 
"steam" and "dress" is 1 respectively. 

The weights of the document feature classification and the document feature are determined according to the 
above Rule 2. When the ranking of the documents is calculated using the evaluation function of the equation 1 , suppos- 
ing that the indocument frequency of the document feature of each document is 1 , the ranking is: document 3 - docu- 
ment 2 - document 1 , The ranking table thus obtained is given in Fig. 1 1 . 

The user can add the document feature classification having no corresporxJing input feature classification or the 
document feature having no con-esponding input feature, as an input feature classification or an input feature. For exam- 
ple, it is supposed that the user inputs a request via the user request processing unit 12. that the document features 
"toil", "steam" and "dress" belonging to the document feature classification C of the documents 2 and 3 in Rg. 6 should 
be added to the input feature classification C of Fig. 5. Because the above three input features specified by the user are 
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not included in the input feature classification C. among the document features of the document feature classification 
C. these are added to the Input feature dasstfication storage unit 17 as new input features of the input feature classifi- 
cation C. 

Further, in case the user inputs, via the user request processing unit 12. a request that the document featui-e clas- 

5 stf ication F of the document 2 of Rg. 6 and the document feature "mackereP and **Spanish mackereP and Input feature 
classification and its input features, these are added to the Input feature classification storage unit 1 7 as new input fea- 
ture classification and its input features because there are none to correspond to the document feature classification F 
and its document features in the input feature classification and the input features of Rg. 5. 

Fig. 12 shows an example of data of the input feature classification storage unit 17 after connection, which is 

10 obtained by adding the document feature classification F and its document feature and some of the document features 
among the document feature classification C to the data of the input feature classification storage unit of Rg. 5. In Rg. 
12. the newly added input feature classification and the Input features newly added are enclosed by the synrd3ols ^*'. 

Also, ranking is given by newly adding weight to the corrected input feature classification or the input features. For 
example, it is supposed that the user sees the data of the input feature classification storage unit 1 7 corrected as shown 

15 in Fig. 12 and wants to retrieve by selecting the document relating to "stewing vegetables or fishes'*. The user sets the 
weight of the input feature classifications A and F of Fig. 12 as 5. the weight of input feature belonging to the input fea- 
ture classifications A or F as 1. the weight of the input feature classification C as 5. the weight of the feature "stew" 
among the input features belonging to the input feature classification C as 10. the weight of the input features other than 
"stew" belonging to the input feature classification C as 0. and the weight of the other input feature classifications and 

20 the input features as 0. Rg. 13 shows the data of the input feature classification storage unit 17 thus corrected. 

The weight of the document feature classification and the document feature are determined according to the above 
Rule 2. If it is supposed that the in-document frequency of the document feature of each document is 1 . the ranking of 
the documents is calculated using the evaluation function of the equation 1 , and the ranking is: document 2 • document 
3 - document 1 . Rg. 1 4 shows the document ranking table thus obtained. 

25 As described above, according to the present invention, the contents of query and the document are presented to 
the user as corresponding relationship between the partial concept expressing each of the contents and linguistic fea- 
tures belonging to each concept. If necessary, the user performs collating by adding correction and weighting to each 
of the concepts and linguistic features, and the collating method and the result of collating are easily confirmed. As a 
result, it is possible to obtain advantageous effects, i.e. to effidentiy select the documerrt closer to the user's intention 

30 of retrieval and to perform retrieval with high accuracy. 

It shouki be understood that the foregoing relates to only preferred embodiments of the present invention, arKl that 
it is Intended to cover all changes and modifications of the embodiments of the invention herein used for the purpose of 
the disclosure which do not departs from the spirit of tiie invention. 

Linguistic features extracted from the query are classified into concepts expressing content of the query, and lin- 

35 guistic features exfracted from the document are classified into concepts expressing content of tiie document. The user 
confirms of which concept the input statement and the document are comprised and which kind of linguistic feature cor- 
responds to each concept and tiie system assists tiie user to adequately select a document, which is closer to the 
irrtention of retrieval. 

40 Claims 

1 . A document retrieval system, conrprising: 

an input/output control unit for receiving input from a user arxJ for presenting result of processing to the user: 
45 a user request processing unit for accepting request otiier than query among the inputs from the user, and for 

processing content of the request; 

a document storage unit for storing a document to be retrieved; 

a feature extracting unit for extracting linguistic feature of the input statement inputted from said input/output 
control unit and for extracting linguistic featijre of the document stored in said document storage unit as docu- 
50 ment featijre; 

an input feature storage unit for storing input feature exfracted from the query by said feature extracting unit; 
a document feature storage unit for storing linguistic feature taken out of the document by said feature extract- 
ing unit; 

a feature classifying unit for classifying tiie input features stored in said input feature storage unit so tiiat it cor- 
55 responds to concept of the content expressed by the query, or for classifying the document feature stored in 

said document feature storage unit so that it corresponds to concept of the contents expressed by the docu- 
ment; 

an input feature classification storage unit for storing correspondence of the input feature classification and tiie 
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input feature as a result of classification of the input feature by said feature classification unit; and 
a document feature classification storage unit for storing correspondence of the document feature classifica- 
tion with the documerrt feature as a result of classification of the document feature by said feature classifying 
unit whereby: 

5 in response to a request sent from the user via the user request processing unit the contents of the query or 

the specified document are presented to the user via said input/output control unit as con-esponding relation- 
ship of group of concepts expressed by the input feature classification and the document feature classification 
with the linguistic feature belonging to each concept group. 

10 2. A document retrieval system according to Claim 1 . wherein, in response to a request of con-ection from the user via 
the user request processing unit, corresportdence of the query with the input feature dasstfication stored in the 
input feature classification storage unit, the input feature classification and the input feature, and con^espondence 
of the accumulated document with the document feature classification storage in the document feature classifica- 
tion storage unit and the document feature classification and the document feature as well as the document feature 

15 stored in the document feature storage unit are corrected. 

3. A document retrieval system according to Claim 1 or 2, wherein there are provided a feature collating unit for col- 
lating the input feature classification stored in the input feature classification storage unit and the corresponding 
input feature with the document feature classification stored in the document feature classification storage unit and 

20 the corresponding document feature, and a collating result storage unit for storing method and result of collating by 
said feature collating unit. 

4. A document retrieval system according to Claim 3, wherein, when the input statement is collated with the accumu- 
lated document a specific document feature classif ication and document feature selected by the user via the user 

25 request processing unit are treated as provisional input feature classification and input feature, and collated with 
the document feature classification and the document feature of each document by the feature collating unit, and 
the result of the collating is presented to the user. 

5- A document retrieval system according to Claim 3 or 4, wherein, when the query is collated with the accumulated 
3d document, weight is set to the input feature classification and the input feature depending upon the degree of 
importance specified by the user via the user request processing unit and said weighted input feature classification 
or input feature are collated with the document feature classification or the document feature of each document by 
said feature collating unit, and the result of the collating is presented to the user. 
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FIG. 2 

RADISH, CARROT, POTATO OR KIDNEY BEANS ARE BOILED USING 
SINGLE-HANDLE OR TWO-HANDLE POT. OR BAKED OR FRIED USING 
FRYING PAN. 



FIG. 3 

FnputTeature frequency 

"raoIsh" " " "T 

carrot 1 

POTATO 1 

KIDNEY BEANS 1 

SINGLE-HANDLE POT 1 

TWO-HANDLE POT 1 

STEW 1 

FRYING PAN 1 

BAKE 1 

FRY 1 



FIG. 4 
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FIG. 6 
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FIG. 8 
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FIG. 12 



INPUT FEATURE INPUT FFATIIRF 

CLASSIFICATION FEATURE 



A : VEGETABLE RADISH. CARROT, POTATO, KIDNEY BEANS 

B: COOKING UTENSILS SINGLE-HANDLE POT, TWO-HANDLE POT. FRYING PAN 

C : COOKING METHOD STEW, BAKE. FRY, *BOIL*, ♦STEAM*, ♦DRESS* 

F:*FISH* *MACKEREL*, ♦SPANISH MACKEREL* 
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